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HETEROLOGOUS PROTEIN PRODUCTION SYSTEM 

USING AVIAN CELLS 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

The present invention relates to novel expression systems that 
can produce biomedically important heterologous proteins including 
human erythropoietin (hereafter "EPO"), and more specifically to the 
production of various heterologous proteins by transfecting DNA 

0 encoding the proteins, such as the genomic DNA encoding EPO into 
avian cells. 

2. Related Arts 

Many recombinant proteins used in medicine are relatively small 
and simple in their structure, and biologically functional proteins can be 

5 produced in prokaryote such as E. coli. However, some human 
proteins of medical interest, such as TPA (tissue plasminogen 
activator), Factor VIII, EPO, etc. are more complicated because 
biological function requires post-translational modification. For 
example, EPO is extensively glycosylated with the carbohydrate portion 

) accounting for 40 % of the molecular mass. It has been shown that 
the carbohydrate portion of EPO is important for biological function. 
Accordingly, EPO produced in E. coli, yeast or insect is inactive or very 
weakly active in vivo, while EPO produced in COS or CHO cells was 
found to be fully active. Accordingly, those kinds of heterologous 

1 proteins have been produced only in mammalian cells. 

In the meantime, the avian system has been used for the study 
of gene expression in higher eukaryote for a long time. One of the first 
viruses to be linked to tumors was the Rous sarcoma virus of chicken, 
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and this virus was instrumental in demonstrating that the retroviral 
oncogene can originate from a cellular gene, leading to the concept of 
the protooncogen. Studies of gene expression have also been done 
using the RSV LTR promoter, which has often be used for high level 
expression of heterologous genes in mammalian cells. In addition, 
avian embryo cells have been used extensively in studies of various 
animal viruses. 

SUMMARY OF THE INVENTION 

The present invention is a research for the high level expression 
of eukaryotic heterologous proteins. It is an object of the present 
invention to provide a novel heterologous gene expression system 
which can produce proteins of higher eukaryotic cells. It is another 
object to provide the method of efficiently producing higher eukaryotic 
proteins, such as EPO, etc., which has been known to be active only 
when they are produced in a mammalian cell. It is a further object of 
the invention to provide the method of producing, especially, EPO 
among the eukaryotic proteins described above. 

To accomplish the objects of the present invention, the present 
invention provides a heterologous gene expression system comprising 
a DNA encoding a heterologous protein, a vector for receiving the DNA; 
and an avian cell for harboring the vector. 

The present invention also provides a method of producing a 
heterologous protein comprising the steps of culturing the expression 
system of claim 1 in media to express the heterologous gene, and 
purifying the heterologous proteins from the cell and the media. 

Preferably, the heterologous protein of the present invention is 
selected from the group consisting of those proteins that are known to 
be active only when expressed in mammalian cells (such as EPO, TPA, 
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Factor VIII, etc.) and preferably, the vector contains a promoter 
selected from the group consisting of SV early promoter, major 
immediate early promoter of human cytomegalovirus (hereafter "HCMV 
MIEP") and RSV LTR, and preferably, the avian cell is selected from 
the group consisting of duck embryo cell (hereafter "DE"), chicken 
embryo fibroblast (hereafter "CEF") and quail fibrosarcoma (hereafter 
"QT"). more preferably QT-VC which was isolated by the inventors. 
QT-VC was deposited to the International Depositor/ Authority, Korea 
Research Institute of Bioscience and Biotechnology Korean Collection 
for Type Culture, and assigned a deposit number of KCTC 0277BP on 
August 22, 1996. The deposited QT-VC was transfected with the 
expression vector containing SY-EPO cDNA as described in Fig. 8. 

More preferably, the DNA encoding the heterologous protein is 
genomic DNA or cDNA. 

Further, the present invention provides an EPO production 
system comprising a DNA encoding EPO, a vector for receiving the 
DNA, and an avian cell for harboring the vector. 

Moreover, the invention provides a method of producing EPO 
comprising the steps of inserting a DNA encoding EPO into a vector, 
transfecting the vector into an avian cell, and culturing the transfected 
avian cell in media. 

Preferably, the avian cell of the EPO production system is DE or 
QT, and the DNA is a genomic DNA encoding EPO, more preferably, 
the DNA selected from the group consisting of SY, JM, SH and HE 
described in Fig. 5. 

Preferably, the vector has a promoter selected from the group 
consisting of SV early promoter, HCMV MIEP and RSV LTR. 

The present invention also provides an avian cell as a host for 
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expressing genes encoding mammalian proteins. 

Further, the present invention provides an novel EPO genomic 
sequence selected from the group consisting of SY, JM, SH and HE 
described in Fig. 5, and also provides an novel EPO amino acid 
sequence selected from the group consisting of JM, SH and HE 
described in Fig. 6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the expression of the bacterial CAT gene in avian 
cells. DE and CEF cells were transfected with pRc/CMV containing 
(+) or lacking (-) the CAT sequence. CAT activity was measured by 
determining the amount of acetylated chloramphenicol (AC) produced 
from ^-chloramphenicol. The values shown are from one 
representative of more than five independent assays. For this 
particular experiment, 10 \ig of protein was reacted with 14 C- 
chloramphenicol for 20 min at 37 °C. 

Fig. 2 shows the comparison of CAT gene expression between 
various cell types and between different promoters. The three 
promoter-CAT fusion constructs were transfected into DE, CEF, CHO- 
K1, and HeLa cells, and CAT activity was measured as described in 
Fig. 1. S, SV40 early promoter; C, HCMV MIEP; R, RSV LTR. The 
values shown are from one representative of three independent 
assays. For this particular experiment, 1 0 ug of protein was reacted 
with 14 C-chloramphenicol for 30 min at 37 °C . 

Fig. 3 shows the efficiency of DNA transfection in various cells. 
pCMV-lacZ constructs was transfected into DE, CHO, Vera, HeLa, 
and 293T cells by calcium phosphate-DNA coprecipitation using the 
conditions used for the experiments shown in Fig. 2. Two days after 
transfection, cells were fixed and stained with X-gal. The number of 
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blue cells per 60 mm tissue culture plate was counted. The total 
number of cells between plates were comparable at 1-3 X 10 s . 
Transfection efficiency was calculated relative to DE cells. 

Fig. 4 shows the schematic diagram for cloning of human EPO 
and construction of expression vectors. The five blocks represent the 
five coding regions of EPO. The first PCR was performed using 
primers 25 and 33. The amplified DNA fragment was cloned and 
subjected to a second PCR using primers 12 and 9. The wavy tale in 
primer 12 contains the nucleotide sequence from the first coding region. 
Therefore, the second PCR generates the entire coding sequence of 
EPO so that the first and the second coding regions are attached to 
form without intron between them. Primers 12 and 9 contain Hindlll 
linkers at their 5' ends, enabling cloning of the EPO genomic sequence 
into various expression vectors. 

Fig. 5 is various EPO genomic DNA sequences. SY, SH, HE 
and JM are the EPO genomic DNA sequences cloned by the present 
invention, and AM and Gl are the EPO genomic sequences which has 
been already reported. Since the intron between the first coding 
region and the second coding region was deleted during the cloning, 
the deleted intron is not shown in Fig. 5. 

Fig. 6 is various EPO amino acid sequences. SY, SH, HE and 
JM are the EPO amino acid sequences cloned by the present invention, 
and AM and Gl are the EPO amino acid sequences which have been 
already reported. The abbreviation of the amino acids are as follows: 

A: alanine R: arginine N: asparagine D: aspartic acid 

C:cystein Q: glutamine E: glutamic acid H: histidine 

I: isoleucine L: leucine K: lysine M: methionine 
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F: phenylalanine P: proline S: serine 

T: threonine W: tryptophan Y: tyrosine V: valine 

Fig. 7 shows the comparison of CAT gene expression between 
QT-VC and other mammalian cell lines. pCMV-CAT was transfected 
to QT-VC; CHO-K1 , and Vero cells, and CAT activity was measured as 
described in Fig. 1. The transfection efficiency, as measured by X-gal 
staining following cotransfection with pCMV-lacZ, was reproducibly 3- 
5 % in all cases. For this particular experiment, 50 jig of protein were 
incubated with ^-chloramphenicol for one hour at 37 °C. 

Fig. 8 is the typical structure of a plasmid used to express EPO 
in QT cells. The two types of BamHI cassettes which could express 
the gene for human glutamine synthetase (GS) was made. In these 
BamHI cassettes, the GS cDNA sequence was flanked by the poly A 
sequence from the bovine growth hormone gene and one of the two 
promoters, the partial MMTV LTR (from -220 to +15 from the RNA start 
site) or the 220 bp HSV tk promoter. The BamHI fragment expressing 
GS was inserted into the BamHI site of pCI-neo (Promega, Madison, 
Wl, USA), resulting in a series of pIGA. The Hindlll fragment of the 
SY-EPO cDNA sequence was cloned into the Smal site of pIGA, 
generating the EPO expression vector, pIGA-EPO. 

Fig. 9 shows the production of EPO by QT-N4D4. QT-N4D4 
cells were grown to confluence in a 10 cm culture dish (day 0) in M-199 
containing 10 % FBS and 1 mM MSX. On day 3, the EPO level was 
measured. The cells were then split into 1:3 and seeded onto 10 cm 
dishes. On day 6, the cells were again reached confluence, and the 
medium was replaced with 10 ml fresh medium containing 2 % (•) or 
1 0 % (O) FBS. EPO levels were determined by ELISA (R&D system, 
Minnesota, USA) 
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Fig. 10 shows the comparison of EPO concentration in DE (•) 
and QT-N4D4 (O) measured by ELISA and by in vitro bioassay. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The inventors have explored the possibility of using avian cells 
as a host cell for heterologous gene expression. We have chosen to 
use three avian cells; two embryonic cells from chicken and duck, and a 
quail fibrosarcoma line. We chose to use the chicken and duck 
embryo cells for the following the reasons. First, these embryonic 
cells can easily be prepared from eggs, and they divide rapidly, 
undergoing many passages. Second, chicken and duck cells can be 
grown at large scale with relatively low costs. Third, some avian cells, 
such as those from chicken embryos have already been used for 
medical products. For example, influenza virus has been cultured in 
chicken eggs for the production of vaccines. Finally, the culture 
conditions, including media and temperature, required by avian embryo 
cells are virtually identical to those of mammalian cells, suggesting that 
the physiology of avian and mammalian cells is probably comparable. 

Further, the reason of choosing a QT cell line is that various 
transformed cell lines have been already developed, and it is easy to 
handle these cell lines to construct a permanent cell line expressing a 

■ 

heterologous protein, and the culture conditions and media is similar to 

4 

those of mammalian ceils. 

I. Cells and Plasmids 

1 . Cells 

The following Table 1 shows cells used in the experiment. 



t 
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Table 1 



Cells 



HeLa human cervical carcinoma cells 
Vero African green monkey kidney cells 
COS-7 African green monkey kidney cells 
transformed by wild-type T antigen of 

SV40 

CHO-K1 Chinese hamster ovary cells 
NIH3T3 contacted-inhibited Swiss mouse 

embryo cells 
Ad-5 transformed human embryonic 

kidney cells 293 
SL-29 chicken embryo fibroblast cells 



Source 
ATCC CCL2 
ATCC CCL81 
ATCC CRL1651 



ATCC CCL61 
ATCC CRL1651 



ATCC CRL1651 



ATCC CRL1590 



Duck embryo 



Quail fibrosarcoma line QT6 
Quail fibrosarcoma line QT-VC 



ATCC CCL1 41 
or prepared by the 

inventors 
ATCC CRL1708 
Isolated by the inventors 
KCTC 0277BP 



10 



All these cells except QT cell lines were grown in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine 
serum (FBS). QT cell lines were cultured in M199 medium instead of 
DEME. Duck embryo was either obtained from ATCC CCL 141 or 
prepared by trypsin ization of 10- to 13- day old decapitated duck 
embryos. These avian cells were grown in minimum essential medium 
(Eagle) supplemented with non-essential amino acids and Earie's 
balanced salt solution containing 10 % FBS. These cells could be 
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maintained for approximately another 30 passages. Each medium 
used in this study was supplemented with 120 \ig/m\ penicillin G 
(Sigma P-3032; 1690 units per mg) and 200 |ig/ml streptomycin (Sigma 
S-9137; 750 units per mg). 

5 2. Plasmids 

To evaluate the efficiency of heterologous protein production in 
avian cells, pRc/RSV-CAT and pRc/CMV-CAT were constructed by 
inserting a Hindlll CAT cassette (Pharmacia, Piscataway, NJ) into the 
Hindlll sites of pRc/RSV and pRc/CMV (Invitrogen, San Diego, 

10 California, USA), respectively. For pSVCAT, the plasmid p9l8 was 
used, which has been already described by the inventors. For EPO 
expression vectors, three vectors were used. pCMV-gEPO was 
constructed by cloning the Hindlll fragments of the EPO genomic 
sequence into the Hindlll site of pRc/CMV. pSV-gEPO was derived by 

15 replacing the CAT sequence of pSV918 with the genomic EPO 
sequence. pIGA-EPO has cDNA of EPO controlled by HCMV MIEP 
and the genes of NEO and glutamine synthetase (hereafter "GS"). To 
measure the transfection efficiency, the plasmid pCMV-lacZ was 
constructed by inserting bacterial lacZ fragment into the Hindlll site of 

20 pRc/CMV. 

II. DNA Transfection and Gene Expression Assays 

The inventors tested whether avian embryo cells could be used 
for high levels of heterologous gene expression instead of mammalian 
cells. Although avian embryo cells have been used to culture viruses, 
25 there was no report that heterologous proteins of higher eukaryotic cells 
were expressed in these cells. To carry out the study, it is necessary 
to develop the method of efficient transfection to avian cells. That is, 
to express heterologous genes in avian cells, it is required to develop 
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the transfection technique of DNA to target cells. At present, we could 
not find any reports on DNA transfection of avian embryo cells. 
Accordingly, the inventors have developed the technique that CEF and 
DE cells can readily be transfected with DNA. 

Among the techniques available, we have chosen a method 
using calcium phosphate coprecipitation, because this works well for 
various adherent cells and can also be used for establishing permanent 
lines. We have tested many different conditions and found that the 
following procedure was optimum. 

When cultures were 50-70% confluent in a 1 00 mm culture dish, 
a total of 10 jig DNA in HBS buffer (140 mM NaCI, 5 mM KCI, 0.75 mM 
Na 2 HP0 4 .2H 2 0, 6 mM dextrose, 25 mM HEPES) was incubated with the 
cells for 30 min at room temperature. 10 ml of regular media 
containing FBS was added and incubated for 20 hrs at 37 °C, except for 
CHO-K1 (8 hours). Cells were then treated with 10 ml of 100 nM 
chloroquine, and incubated for another 3 hours at 37 °C. After 
replacement with 10 ml of fresh media, the cells were grown for 1 to 2 
days. Culture supematants were collected and centrifuged at 1000 
rpm for 10 min to remove cells and debris. To measure transfection 
efficiency, cells were transfected with pCMV-lacZ, rinsed once with 
PBS 3 days after transfection, fixed with 0.5 % glutaraldehyde (in PBS) 
for 10 min, and washed twice for 2-10 min each with 4 ml PBS 
containing 1 mM MgCI 2 . For X-gal staining, the staining solution [PBS 
containing 4 mM KaFefCNJe, 4 mM K 4 Fe(CN) 6 .3H 2 0, 2 mM MgCI 2 , and 
400 |ig per ml X-gal (in dimethylformamide)] was added to fixed cells, 
and incubated at 37 °C for 4 hours overnight. When the reaction was 
completed, cells were washed once with PBS. Stained cells were kept 
in PBS. 
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CAT assay was carried out as follows: 

Two to three days after transfection, cells were harvested, 
washed once with PBS, and resuspended in 0.25 M Tris-HCI (pH 7.5). 
Total proteins were prepared by 4 cycles of freeze/thawing followed by 
5 heating at 65 °C for 7 min. Equivalent amounts of protein were 
assayed for CAT activity at 37 °C for 30 min. The amount of protein 
and the reaction time varied, depending on the experiments. For 
example, the CAT activity of cell extracts prepared from DE cells was 
so high that only 10 jig protein and 20 to 30 min reaction time had to be 

10 used, and under this condition, levels of CAT activity in other 
mammalian cells were very low or undetectable. When CAT activity 
became detectable in other cells, virtually alt 14 C-chloramphenicol was 
converted. The percent conversion of ^-chloramphenicol to its 
acetylated forms was determined by cutting out regions containing 

15 unreacted and acetylated forms and quantifying the amount of 
radioactivity in each by liquid scintillation counting. 

III. Gene Expression in DE and CEF 

Gene expression efficiency of DE and CEF was measured using 
CAT gene. We initially chose to use a promoter from the major 

20 immediate-early region of HCMV, because this has been shown to 
drive a high level gene expression in many different cell types. In the 
plasmid pCMV-CAT, the bacterial CAT gene is placed under the control 
of the HCMV MIEP. As a negative control, the plasmid Rc/CMV 
containing the promoter but no CAT sequence was used. These 

25 plasmids were transfected into DE and CEF cells and CAT activity was 
measured to estimate the efficiency of transfection and gene 
expression. One representative result from several independent 
transfections is shown in Fig. 1. Transfection of a control plasmid 
resulted in undetectable levels of CAT activity in both cells. However, 
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transfection with pCMV-CAT resulted in readily detectable levels of 
CAT activity in both cells. In more than five independent transfection 
assays, the level of CAT activity was always higher in DE cells than in 
CEF cells. The magnitude of difference in the level of CAT activity 
between the two cells ranged from 10- to 50-fold, depending on the 
experiment. This result indicated that avian cells were readily 
transfected with DNA and the heterologous genes could be efficiently 
expressed. 

IV. Comparison of Levels of Gene Expression between Avian 
and Mammalian Cells, and between Different Promoters 

We have compared the levels of gene expression between avian 
and mammalian cells, using three different promoters; 

(1) the SV40 early promoter, which is used during the early 
transcriptional phase of SV40 infection; 

(2) the HCMV MIEP, which drives the expression of IEI and IE2 
regulatory proteins, immediately after HCMV infection; 

(3) the RSV LTR from an avian retrovirus. 

These promoters are known to be powerful in mammalian cells, 
and have often been used for high level heterologous gene expression. 

These promoter-CAT fusion constructs were transfected into 
four different cell lines, DE, CEF, CHO-K1 , and HeLa, and CAT activity 
measured to compare the efficiency of gene expression between 
promoters and between cell types. To make this comparison 
semi-quantitative, all transfections and CAT assays were performed at 
the same time and using identical conditions. One representative 
result of such experiments is shown in Fig. 2. Here, 10 u.g of cell 
extracts were incubated for 30 min in the CAT reaction. Under these 
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particular conditions, the levels of CAT expression driven from the three 
promoters were very low in CHO and HeLa cells (Fig. 2). Only after 
larger amounts of proteins were used for extended reaction time, was 
CAT activity detected. In contrast, CAT activity was readily detectable 
5 in the avian cells (Fig. 2), except for the SV40 promoter in CEF cells. 
It indicated that the expression in avian cells are more effective than 
that in mammalian cells. 

The most dramatic finding was that the HCMV MIEP was 
extremely powerful in DE cells. In Fig. 2, the conditions used for the 

10 CAT reaction were chosen to generate the reasonable levels of CAT 
activity in other samples. When the CAT reaction was performed 
under limiting conditions for the protein sample prepared from DE cells 
transfected with pCMV-CAT (i.e., when the CAT conversion was below 
50 %), the levels of CAT activity of all the other samples were virtually 

15 undetectable. Therefore, the magnitude of difference in CAT activity 
between the protein sample from DE cells transfected with HCMV-CAT 
and those from the other transfections is at least two orders of 
magnitude. These results suggested that heterologous genes could 
be expressed very efficiently under the control of the HCMV MIEP in 

20 DE cells. 

It is possible that the high levels of CAT expression seen in DE 
cells could be due to efficient transfection of the cell population, rather 
than an ability of these cells to support strong gene expression. To 
distinguish these possibilities, we transfected pCMV-lacZ into DE and 
25 various animal cells. After transfection, cells were stained with X-gal, 
and the number of blue cells were counted to estimate the transfection 
efficiency. As shown in Fig. 3, the number of stained cell was always 
comparable between DE and other animal cells, suggesting that the 
high levels of CAT expression in DE cells were due to high levels of 



WO 97/08307 PCT/KR96/00145 

14 

expression in individual cells. 

V. Cloning of human erythropoietin 

To test whether DE cells could indeed be used for the 
expression of medically important human proteins, we have isolated the 
genomic DNA encoding the human EPO gene. We chose to use EPO 
as a model because it is a secreted protein, so we could test whether 
DE cells properly process secreted proteins. We also used a genomic 
clone of EPO instead of the cDNA, to assess whether human genes are 
properly spliced to produce functional mRNAs in DE cells. 

DNAs for cloning of EPO were prepared with blood cells 

collected from four people. Human peripheral blood lymphocytes 

were isolated by Ficoll-Hypaque gradient centrifugation of 

heparin-treated blood cells. Total DNA was prepared and used for 

polymerase chain reaction using specific oligonucleotide primers (Fig. 

4). The region around the start codon was highly GC rich, so the EPO 

sequence was cloned by two steps of PCR using two different pairs of 
primers. 

To obtain the genomic DNA for EPO, total DNA was prepared by 
lysing human peripheral blood lymphocytes using TES (10 mM Tris-HCI 
pH 7.8; 1 mM EDTA; 0.7 % SDS) followed by the treatment with 
400 ug/ml proteinase K at 50 °C for 1 hour, phenolrchloroform 
extraction, and ethanol precipitation. The polymerase chain reaction 
(PCR) was performed using 0.1 \ig of total genomic DNA and 
oligonucleotide primers specific to the EPO gene. 

Primer #25 (sense, 5' to 3'): GAAGCTGATAAGCTGATAACC 
Primer #33 (antisense, 5' to 3'): TGTGACATCCTTAGATCTCA 
The samples were amplified through 30 cycles that included the 
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following parameters; denaturation at 92 °C for 1 min, primer annealing 
at 55 °C for 1min, and primer extension at 72 °C for 1 min. The DNA 
fragment amplified from this reaction did not contain the first 13 
nucleotides in the N-terminal region, so a second PCR was performed 
5 using the following primers (Underlined, Hindlll; Outlined, start codon 
and stop codons, respectively). The relative position of these primers 
are as shown in Fig. 4. Taq DNA polymerase (POSCO Chem, Korea) 
and pfr polymerase (STRATGENE, California, USA) were used to 
amplify DNA. 

1 o Primer #12 (sense, 5' to 3'): 

CAAGCJTCGGAGATGGGGTGCACGAATGTCCTGCCTGGCTGTGGC 

Primer #9 (antisense, 5" to 3'): 
CAAGCJTTCATCTGTCCCCTGTCCTGC 

The amplified DNA from the second PCR was cloned into the 
15 pCRII (Invitrogen), from which the Hindlll fragment containing the 
genomic sequence of EPO was inserted into various expression 
vectors as described above. In this experiment, the amplified DNA 
was placed under the control of the HCMV MIEP or SV40 early 
promoter, generating pCMV-gEPO and pSV-gEPO respectively. SY- 
20 EPO whose amino acid sequence is identical to that of the already 
known EPO is used for the expression experiments in the sections VII 
and VIII (See the section VI). 

VI. Analysis of Nucleotide Sequences of Cloned EPO Genomes 

Genomic structure of EPO cloned by the above method is different from 
25 the natural EPO genome in vivo. That is, wild type EPO genomic DNA 
has five coding regions and four introns between them. However, in 
the DNA cloned by the above method, the first coding region was fused 
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to the second coding region to form one coding region so that it has 
four coding regions and three introns (Fig. 4). 

The results from the analysis of EPO gene sequences isolated from 
four people suggested that nucleotide sequences of EPO cloned from 
these region are significantly different from those of the prior two EPOs 
(AM-EPO and GI-EPO) (Fig. 5) at the sites of intron. Such a 
difference was not due to the error which occurred during DNA 
amplification in the process of cloning. We repeated cloning and 
sequencing using ONAs prepared from same individuals (but at 
different times) and obtained the same nucleotide sequence. As 
another control, we amplified the already cloned EPO under the similar 
conditions, and determined the nucleotide sequence. Again, we 
obtained the same nucleotide sequence. 

Amino acid sequences of four EPO genes, together with AM and Gl, 
are shown in Fig. 6. Amino acid sequences from AM, Gl and SY are 
identical. However, amino acid sequences from three people (JM, SH, 
HE) different by two or three different amino acids from Gl- and AM- 
EPO, suggesting that there is a polymorphisms among people. When 
compared with AM- or GI-EPO, HE-EPO had three different amino 
acids at C-terminal, SH-EPO three different amino acid over the whole 
polypeptide, and JM-EPO two different amino acids, one at C-terminal 
and the other in the middle of polypeptide (See Fig. 6). For example, 
while AM-EPO and GI-EPO had serine, alanine, and valine at positions 
36, 100 and 170 respectively, SH-EPO had arginine, serine, and 
tyrosine. Further, while AM-EPO and GI-EPO had valine, lysine, and 
aliginine at positions 170, 177, and 191, HE-EPO had tyrosine, 
glutamine, and glycine. In JM-EPO, lysine and tyrosine were present 
at positions 54 and 170, while they were threonine and valine. These 
results suggested that the EPO gene has a polymorphism in amino 
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acids sequence as well as DNA sequence. 

VII. Expression of EPO in DE Cells 

In this experiment, we compared levels of EPO expression 
between DE cells and other cell lines. 

EPO expression vectors were transfected into various cells 
including DE, CEF, CHO, HeLa, VERO, and 293T. We have included 
VERO cells because they are often used for heterologous gene 
expression, and 293T cells which drive very high levels of gene 
expression, presumably due to both the high frequency of DNA 
transfection and the presence of potent viral transactivators such as 
EIA, EIB, and large T antigen. Two to three days after transfection, 
levels of EPO in the culture supernatants were measured by the 
enzyme linked immunoadsorbent assay, and transfection efficiencies 
were determined by staining cells adhered on the culture with X-gal. 
Transfection efficiency was earned out by transfection of a lacZ 
expression vector together with an EPO expression vector as described 
in the section II. One representative result of this analysis is 
summarized in Table 2. 



Table 2 



Cell 


HCMV MIEP 


SV40 early 
promoter 


HCMV/SV40 


293 


314 


17.5 


18 


CHO 


139.4 


10.4 


13.5 


VERO 


250 


10.7 


23.5 


NIH3T3 


89 


79.4 


1.1 


DE 


4335 


13.8 


3148 



When the SV40 early promoter was used, there was little 
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difference in the levels of EPO between cell types. However, when 
the HCMV MIEP was used, DE cells produced much higher levels of 
EPO than any other cell lines tested. The HCMV MIEP was much 
more active than the SV40 early promoter in almost all the cells tested. 

This difference was especially pronounced in DE cells, where the 
former produced 315 times more EPO than the latter. Among the 
various cell types, DE cells always produced the highest level of EPO. 

CHO cells are the source of cell lines producing EPO that is currently 
used for human application. In this transient system, however, the 
level of EPO in CHO cells was at least 30-fold lower than in DE ceils. 
Difference between DE and 293T cells was also considerable. 
Transfection efficiency of 293T was higher by about 30-folder than any 
other cells including DE cejls. Moreover, 293T cells produce potent 
viral transcription transactivators. Nevertheless, DE produced 10- 
folder more EPO than 293T, suggesting that DE could drive high levels 
of the gene expression. 

In conclusion, human EPO could efficiently be produced and 
secreted in DE cells and that the HCMV MIEP is the promoter of choice 
for driving high level heterologous gene expression in DE cells. 

In summary, we found that DE cells could produce very high 
levels of bacterial and human proteins. All three promoters tested 
drove higher levels of gene expression in DE cells than any other cell 
lines used in this study. In particular, the HCMV MIEP was extremely 
powerful in DE cells. The high level of heterologous gene expression 
observed was not due to a higher number of transfected cells. It 
appears that DE cells properly process splicing and secretion because 
transfection of DE cells with an expression vector containing the EPO 
genomic DNA sequence produced a large quantity of EPO in the 
culture supernatant. 
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For DE cells to be used for industrial purpose, one would need to 
develop large-scale culture techniques for these cells. There are two 
possible ways. First, it may be possible to use primary cells 
themselves as the producer line. A large number of DE cells can 
easily be prepared from 10- to 13 day-old duck embryos. From one 
embryo, we can readily obtain 10 9 to 10 10 cells that can undergo at least 
15 passages. Therefore, it is possible to transfect DE ceils at the 
earliest possible stage with an expression vector followed by selection 
of transfected cells, which might require 4-7 passages. Even if less 
than 5% of the cells were transfected, a large number of transfected 
cells would be available, suggesting that large-scale culture of primary 
duck embryo cells is not impossible with primary cells. Second, it will 
be possible to transform duck embryo cells at an early stage, using one 
of the large number of well-characterized oncogenes that are available. 
With transformed DE cells, a producer line could be constructed, and 
better quality control of protein production be established. It remains 
to be seen whether transformed DE cells will still maintain the capability 
for high level gene expression. Although a number of biological 
questions remain to be answered, the potential of these cells for the 
production of various proteins warrants further investigation. 

VIII. Heterologous Gene Expression 
in the Transformed Avian Cell Line 

The above experiments demonstrated the great potential of DE 
cells as producers of heterologous proteins such as EPO. However, 
DE cells used in the above experiments are primary cells and stop 
dividing after 30-40 passages in vitro. Therefore, unless DE cells are 
transformed or special techniques are developed as described above, it 
is difficult to use these embryonic cells for industrial production of 
heterologous proteins. 
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In the following study, we tested whether the transformed avian 
cell line, namely the quail fibrosarcoma line, could be used to produce 
EPO. The quail fibrosarcoma line used in this study, QT-VC, was 
subcloned from QT6 (ATCC CRL1708). This line was derived from 
methylcholanthrene-induced fibrosarcoma of Japanese quail. QT-VC 
is different from its parental line in at least two aspects. First, QT-VC 
grows faster than the parental line in M199 medium containing 10% 
FBS used in this study. The former divided every 12-24 hours, while 
the doubling time of the latter was 24-36 hours. Second, the QT-VC 
cell looks more roundish than QT6 which generally grows in a longish 
form. Like its parental line, QT-VC did not grow well when it was 
seeded at a low density. Therefore, cells had to be split to 1/3 to 1/2 
after reaching confluence for continuous culture. 

1 . Analysis of Gene Expression in QT-VC Cells 

We compared the levels of gene expression between QT-VC 
and mammalian cells using pCMV-CAT. We chose to use the HCMV 
MIEP as this promoter was shown to drive high levels of gene 
expression in various cell types including avian cells (See the section 
IV). pCMV-CAT was transfected into 3 cell lines, QT-VC, CHO-K1 and 
Vero. To make this comparison semi-quantitative, all transfections 
and CAT assays were performed at the same time and using identical 
conditions. Transfection efficiency was also measured by 
cotransfecting pCM-lacZ followed by X-gal staining. The efficiency 
was approximately 3 % in all cases. Under these conditions, the 
levels of CAT expression in QT-VC cells were always 2-3 times higher 
than mammalian cell lines used in this study (Fig. 7). Although the 
level of gene expression in QT-VC cells appears to be lower than DE 
cells, the quail fibrosarcoma line is at least as good as mammalian cell 
lines, suggesting that it could be used as a producer for heterologous 
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proteins. 

2. Construction of EPO Expression Vectors for QT-VC Cells 

To test whether high levels of heterologous proteins could be 
expressed in QT cells, we have constructed other EPO expression 
5 vectors. The basic strategy for the construction of an expression 
vector was as follows: 

■ 

First, we chose to use the HCMV MIEP to drive expression of the 
heterologous gene as it had already been shown to be one of the 
strongest promoters in avian cells as well as mammalian cells. 

10 Second, the human glutamine synthetase (GS) gene was used 

for amplification of the target gene. Generally, the gene of interest is 
amplified to augment the yield of protein by using certain selectable 
markers in the presence of specific chemicals. One of the best 
examples is the dihydrofolate reductase (DHFR) gene. It has been 

15 shown that the copy number of the heterologous gene and the level of 
respective protein increase as the concentration of methotrexate (MTX) 
in the medium is slowly increased. However, this system requires the 
host cell defective in the gene DHFR, so cannot be directly applied to 
QT cells for which such a mutant line is not yet available. For this 

20 reason, we chose to use the GS gene. In this case, the host cell line 
need not to be deficient for GS, because only multiple copies of the GS 
gene can confer resistance to methionine sulfoximine (MSX). 

The overall structure of EPO expression vectors constructed for 
the use in QT cells is shown in Fig. 8. In this structure, the cDNA 
25 sequence for EPO is under the control of the HCMV MIEP, the bacterial 
Neo gene is used as the first selectable marker, and the human GS 
gene is also present as the second selectable marker in the same 
plasmid. The backbone of expression vectors used in this particular 
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experiment was pCI-neo (Promega, USA) which uses the HCMV MIEP 
and the intron from the {3-globin genome. We have made a couple of 

# 

different constructs in which the GS gene is driven by the partial MMTV 
LTR (from -220 to +15) or the 220 bp HSV tk promoters. In either 
case, the magnitude of gene amplification appears to be comparable 
(Data not shown). Detailed procedure and supplementary data 
regarding the construction of expression vectors is available upon 
request. 

3. Construction of QT-VC Cells Stably Expressing EPO 

To construct QT-VC cell lines constitutively expressing EPO, the 
cells were transfected with an EPO expression vector by a calcium 
phosphate coprecipitation method as described in the section II. 
Three days after transfection, EPO production was confirmed by EILSA 
and transfected cells Were treated with G418 (0.8 mg/ml) and MSX (25 
liM). When G41 8-resistant cells were grown to confluence, cells were 
diluted for sublconing. Because QT-VC cells do not grow efficiently at 
a low cell density, cells were seeded on 10-crn culture dishes at various 
numbers (10 2 , 10 3 , 10 4 , 10 s per dish). Then the colonies that grew 
distant from other colonies were isolated by plastic O rings and 
expanded onto a 96-well plate. When cells reached 70 % confluence, 
the EPO level was measured. Subclones that produced more than 
200 U/ml were serially expanded from the 12-well to 6-well to 60 mm 
culture plates. When cells reached confluence on a 60 mm dish, cells 
were split on 6-well plates and then treated with various concentrations 
of MSX (100 nM, 250 nM, 1 mM). Using this procedure, several 
subclones that produced large amounts of EPO and also grew fast 
were selected. 

One of the subclones obtained through this procedure is QT- 
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N4D4. As shown in Fig. 9, this subclone produced 1200 U/ml when 

grown for 3 days after confluence. When the cells were split to 1:3, 

seeded on 10 cm dishes, and allowed to grow for another 3 days, N4D4 

still produced 1000 U/ml. The medium was then replaced with a fresh 

5 media containing 2 % FBS and the cells still produced 400 U/ml EPO. 

These results indicated that QT cells could produce a large quantity of 
EPO. 

In conclusion, the above experiment demonstrated the great 
potential of QT cells as a producer for heterologous protein. 

1( > IX. Biological Activity of EPO Produced in Avian Cells 

EPO is heavily glycosylated and such glycosylation is required 
for its biological activity. For example, EPO produced in E. coli or 
yeast is inactive or very weakly active in vivo. To test whether EPO 
expressed in DE or QT cells was biologically active, we carried out an in 
15 vitro bioassay using spleen cells isolated from mice treated with 
phenylhydrazine. 

EPO assay: Absolute levels of EPO production after 
transfection of various cells were determined by enzyme linked 
immunoadsorbent assay which is currently used to, measure EPO 

20 levels in the human serum (R&D Systems Inc., Minnesota, USA). To 
measure the biological activity of EPO, in vitro bioassay was carried out 
by the method of Krystal as modified by Goldberg et al Spleen cells 
were taken from C57BL X C3H Fl hybrid mice (Seoul National 
University Laboratory Animal Center) on day 3 after the second of two 

25 daily injections of phenylhydrazine (60 mg/Kg of body weight per day) 
and spleen cell suspensions were prepared with Lymphoprep™ 
(NYCOMED PHARMA AS, Oslo, Norway). The spleen cells (final 
concentration 4 X 1 0 6 cells per ml) were then incubated in 24 well tissue 
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culture plates with various standard doses of EPO (CILAG AG 
International, Switzerland; specific activity 2000 U/ml) or unknown 
samples for 22 hr and then pulsed with 4 ^i/well tritiated thymidine 
(Amersham Co.) for 2-3 hr. The cells were harvested, washed with 
PBS several times and lysed by 0.3 N NaOH and 0.1% SDS. 
Radioactivity in LSC cocktail solutions were calculated by a Pharmacia 
Wallac 1410 scintillation counter. 

Culture supematants from QT-N4D4 cells or DE cells transfected 
with EPO expression vectors were taken to measure levels of EPO by 
both ELISA and the bioassay. The ELISA measures absolute 
concentration, and is currently used for determining EPO concentration 
in human serum. On the other hand, the bioassay determines 
biological activity using a control EPO that has been produced from 
mammalian cells and is currently being used in humans. Fig. 10 
compares the difference in levels of EPO determined by these two 
methods. The ratio between the values (mU) was I ± 0.15, and the 
specific activity of EPO produced from DE cells was estimated to 105 
U/fig. Therefore, the levels of EPO measured by ELISA were very 
comparable to those obtained by the bioassay. This result suggested 
that EPO produced from these avian cells had a similar biological 
activity to commercially available EPO. 
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What is claimed is: 

1. A heterologous gene expression system comprising: 
a DNA encoding a heterologous protein; 

a vector for receiving the DNA; and 
an avian cell for harboring the vector. 

2. The expression system of claim 1, wherein the heterologous 
protein is selected from the group consisting of TPA, Factor VIII and 
EPO. 

3. The expression system of claim 1, wherein the vector 
contains a promoter selected from the group consisting of SV early 
promoter, HCMV MIEP and RSV LTR. 

4. The expression system of claim 1 , wherein the avian cell is 
selected from the group consisting of DE, CEF and QT. 

5. The expression system of claim 4, wherein the QT is QT-VC. 

6. The expression system of claim 1, wherein the DNA 
encoding the heterologous protein is DNA or cDNA. 

7. An avian cell as a host for expressing a gene encoding a 
mammalian heterologous protein. 

8. A method of producing a heterologous protein comprising 
the steps of; 

culturing the avian cell containing the expression system of claim 
1 to express the gene of the heterologous protein in media; and 

purifying the heterologous protein from the cell and the media. 

9. The method of claim 8, wherein the heterologous protein is 
selected from the group consisting of TPA, Factor VIII and EPO. 



WO 97/08307 



26 



PCT/KR96/00145 



10. The method of claim 8, wherein the vector contains a 
promoter selected from the group consisting of SV early promoter, 
HCMV MIEP and RSV LTR. 

1 1 . The method of claim 8, wherein the avian cell is selected 
from the group consisting of DE, CEF and QT. 

12. The method of claim 1 1 , wherein the QT is QT-VC. 

13. The method of claim 8 f wherein the DNA encoding the 
heterologous protein is DNA or cDNA. 

14. An EPO production system comprising: 
a DNA encoding EPO; 

a vector for receiving the DNA; and 
an avian cell for harboing the vector. 

15. The EPO production system of claim 14, wherein the avian 
cell is DE or QT. 

16. The EPO production system of claim 15, wherein the QT is 
QT-VC. 

17. The EPO production system of claim 14, wherein the DNA 
is a genomic DNA encoding EPO. 

18. The EPO production system of claim 14, wherein the DNA 
encoding EPO is selected from the group consisting of SY, JM, SH and 
HE described in Fig. 5. 

19. The production system of claim 14, wherein the vector 
contains a promoter selected from the group consisting of SV early 
promoter, HCMV MIEP and RSV LTR. 

20. A method of producing EPO comprising the steps of: 
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inserting a DNA encoding an EPO into a vector; 
transfecting the vector into an avian cell; and 
culturing the transfected avian cell in media. 
21 . The method of claim 20, wherein the avian cell is DE or QT. 
5 22. Them method of claim 21 , wherein the QT is QT-VC. 

23. The method of claim 20, wherein the DNA encoding EPO is 
a genomic DNA. 

24. The method of claim 20, wherein the DNA encoding the 
EPO is selected from the group consisting of SY, JM, SH and HE 

10 described in Fig. 5. 

25. The method of claim 20, wherein the vector contains a 
promoter selected from the group consisting of SV40 early promoter, 
RSV LTR and HCMV MIEP. 

26. An EPO genomic sequence selected from the group 
15 consisting of SY, JM, SH and HE described in Fig. 5. 

27. An EPO amino acid sequence selected from the group 
consisting of JM, SH and HE described in Fig. 6. 
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FIG.5A 



AM ATGGGGGTGCACGMTGTCCT6CCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

GI ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

SY ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

JM ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

SH ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

HE ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCT 50 

************* A 1 'A A A A A A * ***************** AAAAAAAA* *-*-* 

AM GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

GI GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

SY GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

JM GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

SH GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

HE GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

★ AAA A A A AAA AAA A A AAA * * AAA ** ************* ** ********** 



AM GTGACAGC 
GI GTGACAGC 



SY GTGACAG 
JM GTGACAGC 



GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

:GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

SH GTGACAGAtGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

HE GTGACAGp :GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 

****** *U»* A - A AAAAAAA* ***** * A A A A A * * ************ ** k ** 



AM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

GI AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SY AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

JM AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

SH AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

HE AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

. ************-A " A " A " A"A A A *********************+********* 
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FIG.5B 



AM 
GI 
SY 
JM 
SH 
HE 



AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGG 

.t--l..i t . I . . i . | » 




AACTCCTCCCA 
AACTCCTCCCA 
AACTCCTCCCA 
AACTCCTCCCA 
AACTCCTCCCA 
CTCCTCCCAI 



irk* AAA AAA A* A 



ATCCAGGAACCTGGCACTTGGTTT 248 

ATCCAGGAACCTGGCACTTGGTTT 248 

ATCCAGGAACCTGGCACTTGGTTT 248 

ATCCAGGAACCTGGCACTTGGTTT 248 

ATCCAGGAACCTGGCACTTGGTTT 248 

TCCAGGAACCTGGCACTTGGTTT 250 

****** A A A A A AAA A A* ** 




AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 300 

******* * ******** ** ****** A *AAA AA A A A A A A **Ar A A A A A A A A* * 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 350 

***** A A A AAAAAA A ' A ' AAAA * * **** **********1cA-** *** ** AA A ** 



AM 

GI 
SY 
JM 
SH 
HE 



TCCTA 
TCCTAQ 
TCCTA 
TCCTAd 
TCCTA 
TCCTAC1 

***** 



iCCTGTi 
iCCTGTG 
iCCTGTG 
iCCTGTG 
3GCCTGTG 
ICCTGTGG 




CAG- 
CAG- 
CAA- 
CCAGGGtlCAGG 
CCAGGG-CA 



CCAGGGXA-G 



***** 



** 



aBCCTTCAGGGACCCTTGACTCC 397 

AGCCTTCAGGGACCCTTGACTCC 395 

AACCTTCAGGGACCCTTGACTCC 397 

'^CTTCAGGGACCCTTGACTCC 398 

CTTCAGGGACCCTTGACTCC 397 

CTTCAGGGACCCTTGACTCC 399 

********** ********** 
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FIG.5C 



AM 
GI 
SY 
JM 
SH 
HE 



CCGGGCTGTBTGCATT 



CCGGGCTGWGCATT 
CCGGGCTGT T TGCATT 
CCGGGCTGTE TGCATT 



JAGA^GGCTGTGCTGAACACTGCAGCTTGAAT 
: AGAC jGGCTGTGCTGAACACTGCAGCTTGAAT 



CCGGGCTGT a fGCATt 



CCGGGCTGT STGCATTC 

** A A A A AAAj j A AA A* * 



□GAGA 
J*r*** 



3AGA : jGGCTGTGCTGAACACTGCAGCTTGAAT 

C AGA AGGGCTGTGCTGMCACTGCAGCTTGAAT 

qCAGA jGGCTGTGCTGAACACTGCAGCTTGAAT 

AGA jGGCTGTGCTGAACACTGCAGCTTGAAT 
J***- 



447 
445 
447 
448 
447 
449 



******* AA AAA AAAAAA AAAAAAAA 



AM 
GI 
SY 
JM 
SH 
HE 



GA0AATATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

GA G AATATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 495 

GA A AATATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

GA 3 AATATCACTGTCCC AGACACCAAAGTTAATTTCTATGCCTGGAAGAG 498 

GA G AATATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

GA S AATATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 499 

Jk*******kk*k** ** * kkk ****kAAk Akk *X kA Akk kAkkk* * ** 



** 



AM 
GI 
SY 
JM 
SH 
HE 



GATGGAGGTGAGTTCCTTTTT 



GATGGAGGTGAGTTCCTTTTT 



GATGGAGGTGAGTTCCTTTTT 



GATGGAGGTGAGTTCCTTTTT 



GATGGAGGTGAGTTCCTTTTT 



GATGGAGGTGAGTTCCTTTTT 



1 1 1 1 1 I 



1 1 1 1 1 1 
1 1 1 1 1 I 



llllll 
1 1 1 1 1 1 



llllll 



* A A a A A A A A A A A A A A * ******* A A A *■ * 



n 



n 



n 



TCCTTTCTTTTGGAGAATCT 
rCCTTTCTTTTGGAGAATCT 
rCCTTTCTTTTGGAGAATCT 



njTCCTTTCTTTTGGAGAATCT 
- -FrCCTTTCTTTTGGAGAATCT 
ntrCCTTTCTTTTGGAGAATCT 

**** * ** A AAA AA A * ***** 



547 

545 
547 
548 
545 
549 



AM 
GI 
SY 
JM 
SH 
HE 



CATTTGCGAGCCTGATTT 



CATTTGCGAGCCTGATTT 



CATTTGCGAGCCTGATTT 



CATTTGCGAGCCTGATTT 



CATTTGCGAGCCTGATTT 



I GGATGAAAGGGAGAHfrGATC 
IjGGATGAAAGGGAGA A TGATC 
GATGAAAGGGAGAA TGATC 
GATG AAAGGGAGAb TGATC 
GATGAAAGGGAGA A TGATC 



I 



3 



CATTTGCGAGCCTGATTTEGGATGAAAGGGAGA A TGATC 

* k A ******** k * * ****U**************|Jfr 



r**** 




GGAAAGGT 597 

GGAAAGGT 595 

GGAAAGGT 597 

GAAAGGT 598 

GGAAAGGT 595 

GGAAAGGT 599 

vrxyc "revere 
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FIG.5D 
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AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCT 
AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTi 
AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCT 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCkclGTCTA 
AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCftC 
AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTdCA 



03TCTA 
C 3TCTA 
CSTCTA 
QGTCTA 
CfcTCTA 
A3TCTA 



TAATCCCAGGCTGAG^GCCGAmTGGGAGAATTGCTTGAGCCCTGGAG 
TAATCCCAGGCTGAGA TEGCCGAG ftTGGGAGAATTGCTTGAGCCCTGGAG 
TAATCCCAGGCTGAGA TjSGCCGAlft ftTGGGAGMTTGCTTGAGCCCTGGAG 
TAATCCCAGGCTGAGA TJ3GCCGA SATGGGAGMTTGCTTGAGCCCTGGAG 
TAATCCCAGGCTGAGA C3GCCGA 3ATGGGAGAATT6CTTGAGCCCTGGAG 
TAATCCCAGGCTGAGA X 3GCCGA 3 ftTGGGAGMTTGCTTGAGCCCTGGAG 

************ * » **\jA~*****[jk*******A A AA * *** AAAAAAA** * 

GTTCAGACCAACCTAGGCAGC[A[rAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGC ft TAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGC ft TAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGC ft TAGTGAGATCCCCCATCTCTACAAACAT 
GTTCAGACCAACCTAGGCAGC ft TAGTGAGATCCCCCATCTCTACAAACAT 



GTTCAGACCAACCTAGGCAGC 



TAGTGAGATCCCCCATCTCTACAAACAT 



647 
645 
647 
648 
645 
649 



697 
695 
697 
698 
695 
699 



747 
747 
747 
748 
745 
749 



* * -k-kk A AAAAAAAAA* * * * * 



AM 
GI 

SY 
JM 
SH 
HE 



TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 798 

TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 795 

TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 799 

**^************************ic******^^ 
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FIG.5E 



AM 
GI 
SY 
JM 
SH 
HE 



TGG^AfeGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTI 
TGGA A 3GCTGAGGCGGGAGGATCGCTTGAGCCCAGGMTTTGNSGCTGCW 
TGGA T SGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTi 
TGGA A SGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTi 
TGGA A 3GCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTT 
TGGA A 3GCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTI 

****LJA A AAAAAAAAAA A****A * ** *** AAAAA***AA*** ' 




GCTGCfA! 



GCTGCAj 
GCTGCG 
GCTGCA 
CTGCA 



*u 



847 
845 
847 
848 
845 
849 



AM 
GI 
SY 
JM 
SH 
HE 




CCAGCCTCAGTGACAl 
[TCCAGCCTCAGTGAC 




TCCAGCCTCAGTGACAGAIA TGAGGC 



****** A A A AAA AA A A ' A A A A AAAA| jA "AAA A A A A A A A * * A A A* *| J» A A A* * 



GAGGC 
GAGGC 



GTGAGCTGTGATCACACCACTGC 
GTGAGCTGTGATCACACCACTGC 
GTGAGCTGTGATCACACCACTGC 

GTGAGCTGTGATaCACCACTGCAt|TCCAGCCTCAGTGACAGAjG[rGAGGC 
GTGAGCTGTGATCACACC ACTGCA A TCCAGCCTCAGTGACAGA S TGAGGC 
GTGAGCTGTGATCAC ACC ACTGCA CTCCAGCCTCAGTGACAGA 3 TGAGGC 



897 
895 
897 
898 
895 
899 



AM 
GI 
SY 
JM 
SH 
HE 



CCTGTCTDW\AApbAAMGAAAAMGAAAAA7]AlATGAGGGCTGTATGGA 947 

CCTGTCTCAAAAA A SAAAAGAAAAAAGAAAAAT A ATGAGGGCTGTATGGA 945 

CCTGTCTCAAAAA A aAAAAGAAAAAAGAAAAAT A ATGAGGGCTGTATGGA 947 

CCTGTCTC AAAAAA SAAAAGAAAAAAGAAAAAT A ATGAGGGCTGTATGGA 948 

CCTGTCTCAAAAA CGAAMGAAAAMGAAAAATA ATGAGGGCTGTATGGA 945 

CCTGTCTCAAAAaK 3AAAAGAAAAAAGAAAAAT|r ATGAGGGCTGTATGGA 949 

*LJa-******* * a ** **** * **U**************** 



******* **** *? 



AM 
GI 
SY 
JM 
SH 
HE 



ATAC^CAmTTCATTCACTCACTCACTCACTCATfflCATTCATTCATT 
ATAC^CATTATTCATTCACTCACTCACTCACTCATnlCATTCATTCATT 



ATACAfrTCATTATTCATTCACTCACTCACTCACTCAT 



**** 



ATAC AfrTCATTATTCATTCACTCACTCACTCACTCATfr CATTCATTCATT 
ATACJA|TTCATTATTCATTCACTCACTCACTCACTCAT^ATTCATTCATT 
ATACA|TTCATTATTCATTCACTCACTCACTCACTCATnCATTCATTCATT 



M> IS. 



:attcattcatt 



r*************** * A * * 




997 
995 
997 
998 
995 
999 
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FIG.5F 
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CATTCAAC 
CATTCAAC 
CATTCAAC 




**** * **** 



TCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGC 
TCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTfTG 
TCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCT C 
CATTCMCAjAjSTCTTATTGCATACCTTCTGTTTGCTCAGCTTGGTGCTTfe 

TTTGCTCAGCTTGGTGCT T 3 
TTTGCTCAGCTTGGTGCTfTB 

************ * **** ** * *** ** ***fr** ******* 



CATTCAACA A STCTTATTGCATACCTTCTG" 
CATTCAACA A 3TCTTATTGCATACCTTCTG" 



□ 

3 



3 



1047 
1045 
1047 
1048 
1045 
1049 



AM 
GI 
SY 
JM 
SH 
HE 




UU J _» I 



GGGCTf 
GG 
GG3 




:T3 



TGACTCCC 
TGACTCCC 



I tiAGGGGCAGGAGGGWGAGGGTGACATtCOTi 

TGAGGGGCAGGAGGd43AGGGT(^CA7BGQTCAGCTGACTCCC 
:TGAGGGGCAGGAGGGjll3AGGGTGACATCGGfrCAGbTGACTCCC 
"TGAGGGGCAGGAGGGi^GGGTGACATTSGGfTCAGbTGACTCCC 
:TGAGGGGCAGGAGGGAGAGGGTGACA7i3GG|rCAAbTGACTCCC 

3***\J* 



**u**\j k*Kkkkkk**kkkkk *\ ^kkkkkk ***iA 



1097 
1095 
1097 
1098 
1095 
1099 



***** *** 



AM 
GI 
SY 
JM 
SH 
HE 



AGAGTCCACTCCCTGTp3GTCGGGCAS:AGGCCGTAGAAGTCTGGCAGGG 

GGTCGGGCAGtAGGCCGTAGAAGTCTGGCAGGG 



AGAGTCCACTCCCTG 

AGAGTCCACTCCCTG 

AGAGTCCACTCCCTG 

AGAGTCCACTCCCTG 

AGAGTCCACTCCCTG 
******* *** ****** 



GTCGGGCA ACAGGCCGTAGAAGTCTGGCAGGG 



TCGGGCA 3CAGGCCGTAGAAGTCTGGCAGGG 
TCGGGCA 3CAGGCCGTAGAAGTCTGGC AGGG 
GTCGGGCA 3CAGGCCGTAGAAGTCTGGCAGGG 

***\ Jkkkkkkkkkk* ** kkkkkk **** 



1147 
1145 
1147 
1148 
1145 
1149 



AM 
GI 
SY 
, JM 
SH 
HE 



CCTGGCCCTGCTGTCGGAAppTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 197 

CCTGGCCCTGCTGTCGGAAGCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 195 

CCTGGCCCTGCTGTCGGAAGpTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1197 

CCTGGCCCTGCTGTCGGAAGjCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 198 

CCTGGCCCTGCTGTCGGAATCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 195 

CCTGGCCCTGCTGTCGGAAHCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1 199 

■*k************lr****\jk******ir***^ 
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FIG.5G 
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ACTCTTCCCAE 
ACTC 



AC 

AcrprrcccAiB 




CCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 
TTCCCAGCCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 
CCCApCCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 
CCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 



CCCAACCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 
CCCA 3 CCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 



1247 
1245 
1247 
1248 
1245 
1249 



****i J fcAAA A A AA A AAAA** ***AAAAAAAA * ************ 



AM 
GI 

SY 
JM 
SH 
HE 




agtggccttcgcagcctcaccactctgcttcgggctctgggagcccag 
agtggccttcgcagcctcaccactctgcttcgggctctgggagcccai 
agtggccttcgcagcctcaccactctgcttcgggctctgggagcccai 
agtggccttcgcagcctcaccactctgcttcgggctctgggagccca 
agtggccttcgcagcctcaccactctgcttcgggctctgggagcccag)tY 
agtggccttcgcagcctcaccactctgcttcgggctctgggagcccag g t 




A AAA AAA A A A* * : 



'A* "A'A A" A A A A A "A"*A" A"A-"A""A""A ic 



1297 
1295 
1297 
1298 
1295 
1299 



AM 
GI 
SY 
JM 
SH 
HE 



GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 
GAGTAGGAG 



31 



3GACACTTCTGCTTGCCC 
3GACACTTCTGCTTGCCC 
GACACTTCTGCTTGCCC 
GACACTTCTGCTTGCCC 
BpGACACTTCTGCTTGCCC 
GACACTTCTGCTTGCCC 



tttOtgtaagaaggE}3agaagg 
ttt c tgtaagaagg 3gagaagg 
ttt c tgtaagaagg 3pagaagg 
ttt : tgtaagaagg 3gagaagg 
ttt 3 tgtaagaagg <\ 3agaagg 

TTT : TGTAAGAAGG 3 3AGAAGG 



************* A- A A *****\J******-k*+-ie 



1347 
1345 
1347 
1348 
1345 
1349 



A""A"A"A" A"A A" 



AM 
GI 
SY 
JM 
SH 
HE 



GTCTTGCTMGGAGTACAGGAlApTGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

GTCTTGCTMGGAGTACAGGAACTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

GTCTTGCTAAGGAGTACAGGA AjCTGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

GTCTTGCTMGGAGTACAGGAACTGTCCGTATTCCTTCCCTTTCTGTGGC 1398 

GTCTTGCTAAGGAGTACAGGACTGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

GTCTTGCTAAGGAGTACAGGA TpTGTCCGTATTCCTTCCCTTTCTGTGGC 1399 



r"A"A' A "A 1 A A A A A A" A" A "A A A 
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FIG.5H 



AM 
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HE 



ACTGCAGCGACqifCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 
ACTGCAGCGACCTlCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 
ACTGCAGCGACCTtCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 
ACTGCAGCGACCfTjCCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 
ACTGCAGCGACCfTjCCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 
ACTGCAGCGACCACCTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 

A- A A A A A A A A A * *\Jk A A A A A A A ********** ***** * * ************ 



1447 
1445 
1447 
1448 
1445 
1449 



AM 
GI 
SY 
JM 
SH 
HE 



CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGApKCTTT 1497 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA : ACTTT 1495 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA &CTTT 1497 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA :kcTTT 1498 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATC ACTGCTGA T ACTTT 1495 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA Z ACTTT 1499 

*** * * ************** AAAA AAAA * * ** A A A k * A * ******]]***** 



AM 
GI 
SY 
JM 
SH 
HE 



CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGAEKGCTGAAGC 
CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGA A AGCTGAAGC 
CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGGA A AGCTGAAGC 
CCGCAMCTCTTCCGAGTCTACTCCMTTTCCTCCGGGGAkkGCTGAAGC 



CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGG 
CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGG 




********* * * A *^**^»*^lr^*^^*^***»*^{^ A AAA* * ** 



iCTGAAGC 
iCTGAAGC 



1547 
1545 
1547 
1548 
1545 
1549 



AM 
GI 
SY 
JM 
SH 
HE 



TGTACACAGGGGAGGCCTGCAGGACAGGGGAdAlGATGA 
TGTACACAGGGGAGGCCTGCAGGACAGGGGACA G ATGA 
TGTACACAGGGGAGGCCTGCAGGACAGGGGAC A 3ATG A 
TGTACACAGGGGAGGCCTGCAGGACAGGGGACAEATGA 
TGTACACAGGGGAGGCCTGCAGGAC AGGGGAC aIsaTGA 
TGTACACAGGGGAGGCCTGCAGGACAGGGGACSpATGA 

A-AA AAA A A A A AA * *A AAA A A A AAAA * ****** 



***** 
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1582 
1585 
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1583 
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FIG.6 



AM/GI 

SY 

JM 

SH 

HE 



AM/GI 

SY 

JM 

SH 

HE 



MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDpjRVLERYLLEAKEAE 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDSWLERYLLEAKEAE 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDS^VLERYLLEAKEAE 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDR WLERYLLEAKEAE 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDfeRVLERYLLEAKEAE 

* AAAAAAAAAAAAA* ^^-A-*AAA*A A AAA AA* *V^*,^ AAAAAAAA* * 

NITfflSCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEK 
NITrGCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE<\ 
N ITMGCAEHCSLNEN ITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE^ 
NITJTTSCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSES 
N IT T 3CAEHCSLNEN ITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE <\ 

**^J *AAAAAAAAAAA AAAAA A** ** AAAAAAAAAAA » AAAAAA*A** * _ 



50 
50 
50 
50 

50 



100 
100 
100 
100 
100 



AM/GI 

SY 

JM 

SH 

HE 



VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

AAA * A A A A A A A A A - A A A A A A A A A A A A A *** * **** A A A A A. 



AM/GI 

SY 

JM 

SH 

HE 



AASAAPLRTITADTFRKLFRVpfcNFLRGK 
AASAAPLRTITADTFRKLFRVVBNFLRG 
AASAAPLRTITADTFRKLFRV yIsNFLRGK 
AASAAPLRTITADTFRKLFRV V 5NFLRGK 
AASAAPLRTITADTFRKLFRVYSNFLRG 

il |A A A A A A 



A A A A A A A A * A AAA ***** 



.KLYTGEACRTGD 3 

-KLYTGEACRTGDR 

.KLYTGEACRTGDf 

-KLYTGEACRTGD 

r -KLYTGEACRTGD 3 
****** **[} 
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193 
193 
193 
193 
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FIG. 7 
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FIG. 8 
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FIG. 9 
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FIG. 10 
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